DAMSEL: The DSTO/Macquarie System for Entity-Linking

نویسندگان

  • Matthew Honnibal
  • Robert Dale
چکیده

This paper describes the DSTO/Macquarie University System for Entity Linking (DAMSEL), which competed in the 2009 Text Acquisition Conference Knowledge Base Population task. The system achieves 73.5% accuracy. For a given named entity mention, the system selects a set of candidate entities from the knowledge base and selects the most likely candidate based on the similarity between the document in which the mention was found and the candidate’s Wikipedia article. The best-performing candidate selection strategy took advantage of Wikipedia redirection and disambiguation data. The best-performing similarity measure was the cosine metric.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Effect of Transitive Closure on the Calibration of Logistic Regression for Entity Resolution

This paper describes a series of experiments in using logistic regression machine learning as a method for entity resolution. From these experiments the authors concluded that when a supervised ML algorithm is trained to classify a pair of entity references as linked or not linked pair, the evaluation of the model’s performance should take into account the transitive closure of its pairwise lin...

متن کامل

Estimating the Parameters for Linking Unstandardized References with the Matrix Comparator

This paper discusses recent research on methods for estimating configuration parameters for the Matrix Comparator used for linking unstandardized or heterogeneously standardized references. The matrix comparator computes the aggregate similarity between the tokens (words) in a pair of references. The two most critical parameters for the matrix comparator for obtaining the best linking results a...

متن کامل

Inverse Miniemulsion Method for Synthesis of Gelatin Nanoparticles in Presence of CDI/NHS as a Non-toxic Cross-linking System

In this research, gelatin nanoparticles were synthesized via inverse miniemulsion method by employing a mixture of a water soluble carbodiimide (CDI) and N-hydroxysuccinimide (NHS) as a non-toxic cross-linking system. The gelatin nanoparticles were characterized for their size and size distribution, morphology and stability and were compared with those of nanoparticles cross-linked by glutarald...

متن کامل

The Language Components of DAMSEL: An Embedable Event-driven Declarative Multimedia Specification Language

This paper provides an overview of the three language components of DAMSEL, a framework being implemented at the University of Minnesota. It is comprised of an embedable dynamic multimedia specification language, and supporting execution environments. The goal of DAMSEL is to explore language constructs and execution environments for next-generation interactive multimedia applications. DAMSEL s...

متن کامل

Stochastic Modelling and Computation Importance Sampling Plays a Crucial Role for Quasi-monte Carlo Methods

8:30 – 8:50 Registration 8:50 – 9:00 Opening Remarks 9:00 – 9:30 Mike Hutchinson, Australian National University Locally adaptive gridding of elevation data 9:30 – 10:00 Malcolm Hudson, Macquarie University Block Fisher scoring optimization of penalized likelihoods in emission tomography 10:00 – 10:40 Alex Smola, NICTA Nonparametric tests for distributions 10:40 – 11:00 Morning Tea 11:00 – 11:4...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009